home
***
CD-ROM
|
disk
|
FTP
|
other
***
search
/
ftp.cs.arizona.edu
/
ftp.cs.arizona.edu.tar
/
ftp.cs.arizona.edu
/
icon
/
newsgrp
/
group98a.txt
/
000014_icon-group-sender _Thu Jan 22 10:29:33 1998.msg
< prev
next >
Wrap
Internet Message Format
|
2000-09-20
|
3KB
Return-Path: <icon-group-sender>
Received: from kingfisher.CS.Arizona.EDU (kingfisher.CS.Arizona.EDU [192.12.69.239])
by baskerville.CS.Arizona.EDU (8.8.7/8.8.7) with SMTP id KAA03860
for <icon-group-addresses@baskerville.CS.Arizona.EDU>; Thu, 22 Jan 1998 10:29:30 -0700 (MST)
Received: by kingfisher.CS.Arizona.EDU (5.65v4.0/1.1.8.2/08Nov94-0446PM)
id AA00533; Thu, 22 Jan 1998 10:29:29 -0700
To: icon-group@optima.CS.Arizona.EDU
Date: Thu, 22 Jan 1998 10:27:52 +0100
From: Anders Holtsberg <andersh@maths.lth.se>
Message-Id: <34C71118.299B@maths.lth.se>
Organization: Lund University
Sender: icon-group-request@optima.CS.Arizona.EDU
References: <6a596h$gt3$1@gte2.gte.net>
Subject: Re: Shannon-theoretic Language Approximators
Errors-To: icon-group-errors@optima.CS.Arizona.EDU
Status: RO
Content-Length: 1710
MJE wrote:
> I am wondering whether anyone has written a random-text generator in the Icon
> language of the sort that is described in the book "An Introduction to
> Information Theory : Symbols, Signals and Noise" by John Robinson Pierce
> (paperback; @ US$7.16 from http://www.amazon.com). [...]
If you like that you should buy a copy of Charniak's book Statistical
language
processing. It is about that and about statistical parsers and more. It
is
short and clear and cheap paperback and just great. I read it twice.
And I have a chart parser up and running in Icon that I will give to
others
some day but I must debug and document and add statisical parts and
unification
and output formatting and fancy graphics in my spare time first ...
> MORE GENERALLY: I would be interested in any Icon implementations of language
> statistics. Examples: counting frequencies of characters in a block of text,
> counting word frequencies in a block of text, examining symmetries in poetry,
> computing estimated probabilities of particular sequences of characters.
>
No, probably because there are so many things people would like to
do that a package would have to be very general. Indeed so general as to
be
a full programming language. And there are such beasts really. The best
is
called - well, you just mentioned it - Icon.
> Thank you so very much,
>
> Mark Evans <evans@gte.net>
best wishes
-- Andy
=== Anders Holtsberg ============================ andersh@maths.lth.se
===
Department of Mathematical Statistics Phone +46 46 222 4953
Lund University Fax +46 46 222 4623
=== Box 118, S-221 00 LUND, Sweden === http://www.maths.lth.se/matstat
===